Search CORE

403 research outputs found

Federated PAC-Bayesian Learning on Non-IID data

Author: Ding Wenbo
Liu Yang
Zhang Xiao-Ping
Zhao Zihao
Publication venue
Publication date: 12/09/2023
Field of study

Existing research has either adapted the Probably Approximately Correct (PAC) Bayesian framework for federated learning (FL) or used information-theoretic PAC-Bayesian bounds while introducing their theorems, but few considering the non-IID challenges in FL. Our work presents the first non-vacuous federated PAC-Bayesian bound tailored for non-IID local data. This bound assumes unique prior knowledge for each client and variable aggregation weights. We also introduce an objective function and an innovative Gibbs-based algorithm for the optimization of the derived bound. The results are validated on real-world datasets

arXiv.org e-Print Archive

Growth of nonsymmetric operads

Author: Qi Zihao
Xu Yongjun
Zhang James J.
Zhao Xiangui
Publication venue
Publication date: 08/11/2020
Field of study

The paper concerns the Gelfand-Kirillov dimension and the generating series of nonsymmetric operads. An analogue of Bergman's gap theorem is proved, namely, no finitely generated locally finite nonsymmetric operad has Gelfand-Kirillov dimension strictly between

1

and

2

. For every

r\in \{0\}\cup \{1\}\cup [2,\infty)

r=\infty

, we construct a single-element generated nonsymmetric operad with Gelfand-Kirillov dimension

r

. We also provide counterexamples to two expectations of Khoroshkin and Piontkovski about the generating series of operads.Comment: 32 pages, 9 figure

arXiv.org e-Print Archive

ChatCAD: Interactive Computer-Aided Diagnosis on Medical Image using Large Language Models

Author: Ouyang Xi
Shen Dinggang
Wang Qian
Wang Sheng
Zhao Zihao
Publication venue
Publication date: 14/02/2023
Field of study

Large language models (LLMs) have recently demonstrated their potential in clinical applications, providing valuable medical knowledge and advice. For example, a large dialog LLM like ChatGPT has successfully passed part of the US medical licensing exam. However, LLMs currently have difficulty processing images, making it challenging to interpret information from medical images, which are rich in information that supports clinical decisions. On the other hand, computer-aided diagnosis (CAD) networks for medical images have seen significant success in the medical field by using advanced deep-learning algorithms to support clinical decision-making. This paper presents a method for integrating LLMs into medical-image CAD networks. The proposed framework uses LLMs to enhance the output of multiple CAD networks, such as diagnosis networks, lesion segmentation networks, and report generation networks, by summarizing and reorganizing the information presented in natural language text format. The goal is to merge the strengths of LLMs' medical domain knowledge and logical reasoning with the vision understanding capability of existing medical-image CAD models to create a more user-friendly and understandable system for patients compared to conventional CAD systems. In the future, LLM's medical knowledge can be also used to improve the performance of vision-based medical-image CAD models

arXiv.org e-Print Archive

Recordism: A social-scientific prospect of blockchain from social, legal, financial, and technological perspectives

Author: Fang Yang
Li Zihao
Xu Hao
Zhang Lei
Zhao Boyuan
Publication venue
Publication date: 02/04/2022
Field of study

Blockchain has the potential to reform the architecture of cyberspace and transform the storage, circulation and exchange of information through decentralization, transparency and de-identification. Meaning that ordinary participants can become traders, miners, retailers, and customers simultaneously, breaking the barriers and reducing the information gap between participants in the community, contributing to the futuristic metaverse with an open progressive and equal ideology. Such information transformation empowered by blockchain also profoundly impacts our methodological cognition, legal governance on cyberspace and financial and technological development. This study explores the main question: what are the implications of the blockchain-driven information revolution for society and social sciences? In order to answer this main question, this paper chooses four perspectives, which are methodological, legal, financial and technical. By analysis of these four perspectives, this paper is expected to provide a more comprehensive analysis of the blockchain-driven impact on society, social sciences, and technology to contribute to current scholarships. Additionally, regarding blockchain as an innovative methodological cognition, it grows on top of other technologies while helping advance other technologies. This paper concludes that although there are few frictions between blockchain and current social architecture, blockchain is so much more than the technology itself, that can be a representative of the community, acting as the source of trust, watcher of governance, law enforcer for virtual activities, and an incubator for future technologies

arXiv.org e-Print Archive

Neural Point Process for Learning Spatiotemporal Event Dynamics

Author: Rossi Ryan
Yang Xingyi
Yu Rose
Zhao Handong
Zhou Zihao
Publication venue
Publication date: 20/05/2022
Field of study

Learning the dynamics of spatiotemporal events is a fundamental problem. Neural point processes enhance the expressivity of point process models with deep neural networks. However, most existing methods only consider temporal dynamics without spatial modeling. We propose Deep Spatiotemporal Point Process (\ours{}), a deep dynamics model that integrates spatiotemporal point processes. Our method is flexible, efficient, and can accurately forecast irregularly sampled events over space and time. The key construction of our approach is the nonparametric space-time intensity function, governed by a latent process. The intensity function enjoys closed form integration for the density. The latent process captures the uncertainty of the event sequence. We use amortized variational inference to infer the latent process with deep networks. Using synthetic datasets, we validate our model can accurately learn the true intensity function. On real-world benchmark datasets, our model demonstrates superior performance over state-of-the-art baselines. Our code and data can be found at the https://github.com/Rose-STL-Lab/DeepSTPP

arXiv.org e-Print Archive

ResiDualGAN: Resize-Residual DualGAN for Cross-Domain Remote Sensing Images Semantic Segmentation

Author: Chen Xiuwan
Gao Han
Guo Peng
Sun Zihao
Zhao Yang
Publication venue
Publication date: 28/10/2022
Field of study

The performance of a semantic segmentation model for remote sensing (RS) images pretrained on an annotated dataset would greatly decrease when testing on another unannotated dataset because of the domain gap. Adversarial generative methods, e.g., DualGAN, are utilized for unpaired image-to-image translation to minimize the pixel-level domain gap, which is one of the common approaches for unsupervised domain adaptation (UDA). However, the existing image translation methods are facing two problems when performing RS images translation: 1) ignoring the scale discrepancy between two RS datasets which greatly affects the accuracy performance of scale-invariant objects, 2) ignoring the characteristic of real-to-real translation of RS images which brings an unstable factor for the training of the models. In this paper, ResiDualGAN is proposed for RS images translation, where an in-network resizer module is used for addressing the scale discrepancy of RS datasets, and a residual connection is used for strengthening the stability of real-to-real images translation and improving the performance in cross-domain semantic segmentation tasks. Combined with an output space adaptation method, the proposed method greatly improves the accuracy performance on common benchmarks, which demonstrates the superiority and reliability of ResiDuanGAN. At the end of the paper, a thorough discussion is also conducted to give a reasonable explanation for the improvement of ResiDualGAN. Our source code is available at https://github.com/miemieyanga/ResiDualGAN-DRDG

arXiv.org e-Print Archive